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Abstract 

The theoretical base for consciousness, in particular an explanation of how consciousness is defined by the brain, has 
long been sought by science. We propose a partial theory of consciousness as relations defined by typical data. The theory is 
based on the idea that a brain state on its own is almost meaningless but in the context of the typical brain states, defined by 
the brain's structure, a particular brain state is highly structured by relations. The proposed theory can be applied and tested 

O' 

r\l , both theoretically and experimentally. Precisely how typical data determines relations is fully established using discrete 

vJ ■ mathematics. 
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rn '. 1 Introduction 

V^ ' In neuroscience the neural correlates of consciousness provide an important empirical base for consciousness but not a the- 
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qh. correlates of consciousness at some stage rely on obtaining information about a person's experience by asking them or by 
considering their sensory input. Subsequently a given experience can be associated with the aspects of a person's neuro- 
K^ ' logical state that are always observed for that experience. To exemplify the difference, compare Newtonian mechanics with 



cn 



c^ 



oretical one. To clarify, a theoretical base is a predictive theory that is free from empirical methodology whilst usually 
appealing to, and revealing aspects of, the innate mathematical properties of what is being studied. In contrast the neural 



astronomical predictions based on astronomical tables. Importantly it is expected that the neural correlates of consciousness 



CO ■ alone cannot provide a satisfactory explanation of consciousness since this would invoke some unknown agency that can 

rn 
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experience. Hence an important requirement of a theoretical base for consciousness is that it should avoid the use of any 



discover the external cause of a particular neurological state within the brain so as to associate that state with an appropriate 



prior knowledge of what stimulates the senses. We should expect the brain itself to fully define conscious experience all be 

/\( . it having been stimulated by the senses. To assess whether a particular theory meets this requirement we also need a clear 
H ■ 

notion of what consciousness is. Whilst consensus in this regard will be hard to come by, it can be argued that one funda- 
mental aspect of consciousness is the role played by relations such as those that define geometric content or the individuality 
of objects, their relationships and type such as visual or auditory. We therefore postulate that our conscious experience could 
largely be a mathematical structure defined by relations. In this case the principle underlying how the brain simultaneously 
defines all the required relations is needed. For example, since the part of conscious experience that correlates with the state 
of the primary visual cortex is of a metric space viewed from a particular position, we expect that the primary visual cortex 
ought to define relations between neurons, or other identifiable nodes, that result in a metric space. This paper proposes a 
theory that may satisfy these requirements whilst being theoretically and experimentally amenable to the scientific method. 
Of course the scientific literature does already include important contributions towards establishing a theoretical base for 
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consciousness. Perhaps the most prominent of these is the theory of consciousness as integrated information proposed by 
Giulio Tononi in 2008, see |[T|. Tononi had previously worked with Gerald Edelman the Nobel Prize-winning immunologist 
and subsequent neuroscientist. Together they wrote a book entitled A Universe of Consciousness, see (12), which provides 
significant scientific insight towards an account of consciousness. However, whilst the importance of relations is evident in 
their work, their emphasis does not suggest how the content of consciousness might be defined by the brain. A review of the 
book by Giorgio Ascoli, see ISJ, points out that the authors focus on the properties of the neural process such as integrated 
activity in the highly reentrant dynamic core, where the dynamic core is a large part of the thalamocortical system, and also 
on the properties of consciousness such as unity, privateness, coherence and informativeness. In Ascoli's view the book does 
not address the question of why a sensation corresponds to a specific state of the dynamic core as opposed to another one. 
In this respect I support the view that the relations defined by the brain are important. It can be seen from [4 1 that the brain 
defines relationships between certain patterns of activity occurring in various sensory regions of the brain. For example, for 
a given pattern of activity in the visual cortex we can ask whether it is typical for another particular pattern of activity to be 
present at the same time in the auditory cortex. If so then the given pattern is related to the latter pattern. Consider how such 
a relationship might be contributing to the experience of seeing a picture of Albert Einstein whilst hearing the name Albert 
as opposed to hearing the noun apple. For now the experience associated with a particular pattern of activity may be known 
from the neural correlates of consciousness. However the relationships that the brain defines between patterns allows more 
to be derived about a person's experience than that associated to the patterns in the sensory regions of the brain alone. Hence 
we should try to move down from this higher semantic level replacing neural correlates of consciousness with derivations 
involving relations as we go if possible. I do not however doubt the enduring relevance and importance of Edelman and 
Tononi's work such is the knowledge and insight it provides. 

The mathematics in this paper is straightforward involving binary relations, matrix tables and a small amount of graph theory. 
The relevance of such mathematics for the brain has been noticed before particularly in the study of anatomical and functional 
connectivity, see H, which is a different, and yet associated, purpose to that of this paper concerning consciousness. 
We will start by considering the following properties of the brain that are available for consciousness, noting that the list is 
not intended to be exhaustive,: 

(i) the brain has a large number of identifiable nodes by which we mean neurons in this paper, but more generally possibly 
cortical columns; 

(ii) the brain is capable of a large number of states where a brain state is a possible and probable aggregate state of all the 
brain's nodes; 

(iii) to some extent there is some type of ordering on the collection of brain states since the brain has some of the properties 
of an endofunction, all be it under perturbation by the senses. 



In this paper we will mainly be considering (i) and (ii) of the above. In this respect Definition II. II will be useful where, when 
applied to the brain, the elements of S are the neurons. Merely to keep things simple we will mainly restrict our selves to 
nodes that have a two state repertoire. 

Definition 1.1. Let 5 be a nonempty finite set, n := #S. Then a set, for an arbitrary index label /, 

Si:= {{a,fi{a)) : a£S, ft : S -^ {0,1}}, where /)■ is a map, (1) 

will be called a data element for S. The set of all data elements for S is denoted £1$ so that #0.$ = 2". If a particular subset 
T C1Q.S has been associated with S then we will call T the typical data for S. Further in such cases we will refer to S as the 
carrier set. An element Si G T will be called a typical data element. 

Before we consider the brain the following motivating example will be useful. 

Example 1.1. We will consider what could appropriately be called: The definitive player problem. The purpose of this simple 
example is to introduce the idea that typical data can define a structure on a carrier set which in tern gives an interpretation of 
each typical data element. Consider a library of compact discs and suppose that these discs have all been made to a generic 
template in the sense that the locations of the bits, either or 1, are the same for all discs. Further suppose that the discs all 
produce highly structured output on some standard player which always reads off the bits in the same order relative to the 
generic template. In the language of Definition II. li the generic template is the carrier set S and the library is the typical data 
T . Now suppose we have two of these discs 81,82 ^ T where, on the standard player, 81 is Beethoven and ^2 is Elgar. On 
some nonstandard player where the order in which the bits are read is different to the standard player it could be that ^i is 
Mozart and ^2 is something else, possibly white noise, depending on the reading order. Therefore a single disk on its own is 
almost meaningless. However, by requiring highly structured output, each disc 8i in the library defines a subset of the set of 
all players. By taking the intersection of all these subsets we will be left with relatively few players including the standard 
player If the hbrary is large enough and we could measure how structured an output is then the typical data might determine 
a definitive player and hence, in the context of the library, 81 is Beethoven and ^2 is Elgar. 

The definitive player in this example is essentially a relation between the bits on the generic disc template, i.e. the carrier set, 
such that almost every bit is related to two other bits so as to form a sequence up to a choice of direction. When a disc from 
the library is played on the definitive player the output has relatively few abrupt transitions in output frequency and so there 
is some similarity between the relation on the carrier set and what is written on the discs. 

We finish this example by mentioning that there are plenty of different choices of typical data, i.e. libraries, available and in 
particular many more than there are players. If there are n bit locations on the generic disc template, so that #8 = n, then there 
are n\ different players by which we mean «! different sequences of these bit locations. Further the number of different discs 
that can be written is 2", that is #0.^ = 2". Therefore the number of different subsets of Q.s is 2^ and it is straightforward to 
show by induction that 2^ > n! for all « G N. 



In the next section we will see that the appropriate relation to put on the carrier set, if unique, is explicitly determined by 
the typical data itself. Suppose in Example 1.1 that instead of the data points on the discs having a two state repertoire, bits, 
there were as many states as output frequencies or that the nodes on the generic disc template are the bytes instead of the bits. 
Then the theory in the next section would apply to Example ll.ll and there would not be a problem concerning how to measure 
the quantity of structure of an output. Moreover towards the end of this paper we will argue that the theory presented solves 
what is known as the binding problem. 

2 Relations defined by typical data 

We will refer to Table [T]below several times in this section. In Table[T|the carrier set has four elements, S = {a,b,c,d}. 
There are 24 different sequences, i.e. one dimensional arrangements, of the elements of S and these appear in the column 
headings of the table. There are 16 different binary data elements for S and each row of Table [T] gives a particular data 
element under the 24 different one dimensional arrangements. Now let T := {85,810,813} be the set of typical data for 8. 
Let us try to arrange the elements of 5 in a way that achieves something similar to that exemplified by the definitive player 
problem. We can consider which sequence, or other arrangement, of the elements of the carrier set gives the most structured, 
transition free, interpretation of the typical data elements. The sequence acdb and its reverse bdca satisfy this requirement 
since under these arrangements, for each typical data element, the zeros and ones are unmixed. In the sequel we introduce 
relations to show how the typical data determines the structure on the carrier set. Since this structure is given by a symmetric 
relation, as opposed to an antisymmetric relation in the case of total orders, the problem of whether T gives acdb or bdca as 
the definitive arrangement of the carrier set will be solved. We begin with the following standard definitions which will be 
particularly useful here. 

Definition 2.1. Let 5 be a nonempty set. A binary relation on 5 is a subset R C 8^ where 8^ :~ {ia,b) : a G 8,b G 8}. For 
a,b^8we say that a is R-related to b, and write aRb, precisely when {a,b) G R. We say that R is: 

(i) reflexive if (a, a) G /? for all a G 5; 

(ii) symmetric if for every {a,b) G /? we also have {b,a) G R; 

(iii) antisymmetric if for every pair of distinct elements a, fe G 5 at most one of {a,b) and [b, a) is an element ofR; 

(iv) transitive if for every triple of elements a,b,c G 8 with {a,b) G R and {b,c) G Rwe also have (a,c) G R; 

(v) an equivalence relation if R is reflexive, symmetric and transitive. 

There is a strong connection between the theory of relations on a set and graph theory. In the following definition we use 
some graph theory terminology. 
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Definition 2.2. Let 5 be a nonempty finite set and R (- S^ a symmetric relation on 5'. For a,b & S a walk from a to b, if one 
exists, is a finite sequence (^jOieji, ...,„}, « G N is odd, sucii tliat: 

(wl) k\—a and k^ = b; 

(w2) we iiave ki e 5 if / is odd and ki &R'\f i is even; 

(w3) for / even we have ki = (A:,_ i , A:,+ 1 ) . 

For a, /? e 5 let Ka^t, denote the set of all walks from a to b. The R-distance between two elements a,b E S is 

, / ,N < min{^: (^;)iG{i,-.,«} e^«,i} if ^„,i is nonempty 
dR{a,b):={ (2) 

if K^b^Ql. 



Lemma 2.1. Since R is symmetric, the /^-distance d« defined in Definition l2.2l is either a metric or an extended metric on S. 
By extended metric we mean a metric that takes non-negative values on the extended real line, [— oo^oo]. 

Proof. One checks the four standard metric axioms. D 

Remark 2.1. Let 5 be a nonempty finite set and n := #S. Then S^ is the equivalence relation on S with just one equivalence 
class. Whilst the graph diagram of a graph need not be unique, by applying uniformity principals for the lengths of edges and 
angles between adjacent edges, many graph diagrams are unique. For example, the graph diagram of S with the relation S^ is 
given by the edges and vertices, nodes, of the « — 1 dimensional regular simplex e.g. for n—4 the simplex is a tetrahedron. 

In the sequel the following metric will also be useful. 

Lemma 2.2. Let 5 be a nonempty finite set and let 2^ be the set of all binary relations on S, noting that this is the power set 
of 52. Then 

dA{R,R'):=#{RAR'), R,R' e2^\ (3) 

is a metric on 2^ where RAR' :— {RUR')\{RnR') is the symmetric difference of R and R'. We call dA the symmetric 
difference metric on 2^^ . 

Proof. Standard for S^ finite. D 

The following example shows how typical data determines a structure on the carrier set. 

Example 2.1. With reference to Table [T] again let T — {85,810, Sij,} be the set of typical data for 8 = {a,b,c,d}. With 
reference to Definition ll.il we note that each typical data element 5, = {(a, /;(«)) : a G 5, f : 8 ^ {0,1}} defines an 
equivalence relation on 8 of the form 

R{8t):^{ia,b):a,be8, fi{a)^fib)}. (4) 



Hence from T we obtain the relation tables inFigure[T] Note that for numerical cell values use 1 — \fi{a) ~ fi{b)\ for a,b G S. 
Now we aggregate the relation tables in Figure [Tjinto a single weighted relation table Rj by calculating the mean number of 
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Figure 1: The relation tables defined by the elements of T. 

dots per table cell. Hence for a,b E S, Rj shows the proportion of equivalence relations defined by the elements of T that 
have a related to b. The table Rj is shown in Figure|2] Now, for a threshold value of 0.5, we round the cell values of Rj such 
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Figure 2: The structure on the carrier set S determined by T. 



that values greater than 0.5 are rounded up to 1 and values less than or equal to 0.5 are rounded down to 0. This results in the 

relation Rg. We note that a relation obtained in this way will always be symmetric but in general it need not be transitive. In 

particular Rg is not transitive but since it is symmetric it defines a metric or an extended metric on S by Lemma IZTI Hence 

we will refer to S with Rs, defined by T, as the carrier space. 

The graph diagram of S with the relation Rg is given by Gs in Figure |2] Arguable Gs is one dimensional and we note that it 

agrees with our discussion at the beginning of Section|2]since being a non-directed graph. 

We note that as theory develops it might be useful to retain the weighted relation Rj instead of only working with Rs- In 

particular one can obtain a hierarchy of relations from Rj by varying the rounding threshold. However there are good reasons 

for choosing a rounding threshold of 0.5. In particular Rs is such that the mean of the distances between Rs and the elements 

R{Si) obtained from T is minimized, that is 



^ ^ dA(/?5,/?(5,)) -mini ^ Y. dA(-R,-R(5,)) :-Ris arelationon5 I 



(5) 



In general Rs need not be unique in this respect if the value 0.5 appears in the relation table for Rj. We will shortly relate Rs 



to something we will call float entropy which also supports a rounding threshold of 0.5. 

The following example uses typical data which defines a structure on the carrier set that is not one dimensional. 

Example 2.2. With reference to Table[T] let T' = {S(,,S^,Si(,} be the set of typical data for S' := S where S is the carrier set 
of Example 12. II Following the theory introduced in Example 12 . II gives the results presented in Figure |3] 
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Figure 3: The structure on the carrier set S' determined by T'. 

2.1 Float entropy 

In this short subsection we will discuss the notion of float entropy. Let 5 be a carrier set, T CQ.^ the typical data of S and R a 
relation on S. Suppose we consider T to be the set of possible messages that can be sent to a receiver. In standard information 
theory the receiver would also have a copy of T so that sending a message only involves sending enough information to 
identify the intended element. Instead of this suppose that the receiver only has a copy of S and R. For Sj ET if the relation 
R{Si) is relatively close to R with respect to dA then the number of bits that need to be sent to the receiver in order to specify 
Si will be relatively small. In this case Sj is highly compressible, carries little information and is highly structured relative 
to R. We summarize this situation by saying that 5, has low float entropy relative to R. The extreme case of minimum float 
entropy occurs when R{Si) = R which is possible if R is an equivalence relation. With reference to Definition 1 1.1 1 we can 
quantify float entropy relative to a given relation R as follows. 



fc{R,Si) := log2(#{5y e as : d^{R,R{Sj)) < d^{R,RiSi))}). 



(6) 



This is a measure in bits of the amount of information required to specify Si under the assumption that what is being specified 
ought to be highly structured relative to R. We can consider some values for examples 12. 11 and |2.2| Recall that in Example 
12. II we have T ~ {S5,Sio,Si3} and in Example l2.2l r^ = {56,^9, ^le}. For Rs from Example l2. ll we have fe{Rs,Sio) ~ 1 and 
fe(^5,5'5) — fe{Rs,Si3) = 2.58 to two decimal places whereas in contrast fe{Rs,Sg) — 4. We will denote the mean of the 
float entropies for the elements of T with respect to Rs by fe{Rs,T) and extend this notation to T' and Rs' from Example 
122] accordingly. Working to 2dp throughout gives fe{Rs,T) — 2.06 and fe{Rs',T') = 2.58 whereas fe{Rs,T') = 3.55 and 
fe{Rsi,T) = 3.87. Hence we see that the relations obtained by the method shown in the examples are, relative to their 



respective typical data, a good choice in order to minimize the mean float entropy. 

Now let S be the set of neurons of a brain and T the set of brain states where a brain state is a possible and probable aggregate 

state of all the brain's neurons. If we are trying to approximate T then ideally T will be selected such that, as a random 

variable restricted to T, the brain has a uniform distribution over T. Further ideally T should be large enough so that the 

probability of the brain being in a state that is close to at least one of the elements of T is high. Under these conditions we 

note, by Equation|5] that setting R := Rs is a good choice in order to minimize the expected float entropy. 

In the next section we will to some extent consider the possible relevance of the theory in Section|2]to the brain. We will also 

extend the theory to what we will call objects. 

3 The brain and relations between objects defined by typical data 

Although our theory is to be considered for typical data elements of the state of the whole brain, we begin this section by 
considering the relevance of the theory to the primary visual cortex, VI. Associating the retina with the unit disc of the 
complex plain and similarly embedding the flattened cortical sheet of VI into the complex plain, we note that the retino- 
cortical mapping to VI on a given side of the brain is approximately logarithmic and is therefore far from being an isometry, 
see 161 and I?]- Hence the geometry of VI cannot account for the perceived geometry of monocular vision. Furthermore, the 
right side of each retina is mapped to the right side of the brain whereas the left side of each retina is mapped to the left side of 
the brain. Hence the signals from a given retina go to two different brain areas. Despite this the perceived geometry produces 
a seamless isometric version of the image on the retina. Such facts underline the need for a theory such as that initiated in 
this paper since we need to explain how perceived geometry is defined by the brain. 

Let S be the set of neurons in VI. Further let a' and b' be two distinct points that are fixed relative to the eye in a person's 
field of view as depicted in Figured Let a be a neuron in VI that is stimulated by the retina when there is stimulation of the 





Figure 4: Two fixed points in a person's field of view. 



retina from a' . Similarly let bhe a neuron in VI that has the same relationship with b'. Consider the typical data T for VI. 
We note that abrupt transition lines between light and dark or regions of different color are relatively sparse in the field of 
view. In a somewhat simplified analysis, suppose that there are usually no more than n abrupt transition lines in the field of 



view. As depicted in Figure ID let / be the length of the line through a' and b' crossing the field of view and d the viewable 
distance between a' and b' . Suppose that all n transition lines intersect the line through a' and b' . Then the probability P„ that 
there is one or more transition lines between a' and b' is 



d_\" 
I 



(7) 



We note that lim^^^o^n ~ 0. Hence if d is small then a will be in the same state as b in the majority of the typical data 
elements of T. On the other hand if d is large then arguably a and b will rarely be in the same state. Therefore the relation 
Rs on S defined by T ought to correspond well with the structure of the field of view. This claim is supported below by the 
results of a study using digital photographs to test how well the theory establishes relative pixel positions. 
First though we note that evidence has been found for VI that supports the BCM version of Hebbian theory, see |I8] and ||9]. 
Hebbian theory implies that if a' and b' are close together then stimulation of a and stimulation of b from within VI ought 
to usually happen together. Therefore the typical data is typical of the states that V 1 can internally generate by itself. Hence 
V 1 defines Rs and by doing so it defines the interpretation of the current state of V 1 . Whilst this is the case in theory further 
investigation is required when the full complexity of the visual system is considered. 

Now a study was conducted using 105 digital photographs taken of everyday scenes using the same seven megapixel digital 
camera. A computer program centered a 5 x 5 grid of sampling points over each photograph and recorded to which brightness 
class each point belonged. Here the grid points are the elements of S whereas an element of T is given by the values obtained 
for one of the photographs so that #T = 105. Two parameters are involved the first being the grid point spacing in pixels of 
adjacent grid points and the second being the number of brightness classes used. The second parameter is therefore the node 
repertoire and, apart from the fact that the repertoire was not restricted to two, everything proceeded as per examples 12.11 
and l2.2l Results showed that Rs was close, with respect to dA, to the relation for the grid provided that the parameters used 
corresponded to a point on the curve in Figure |5] Now suppose we numerate the elements of T from 1 to 105 and calculate 
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Figure 5: Established parameter options. 
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Rs after the first n elements for n G {1,5, 10, 15, ■ ■ ■ , 105}. Figure |6] shows how the acquired relation converged toward the 
relation for the grid as n increased. The parameters used for Figure 0are indicated by the point p in FigureE] Further Figure|7] 



□ Distance given by ^d^ between the 

acquired relation and the relation of the grid. 

X Number of edges in the acquired graph 
diagram omitted by the grid. 

O Number of grid edges omitted in the 
acquired graph diagram. 




n 1 1 1 1 1 1 1 1 1 1 

50 100 
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Figure 6; Convergence to the relation for the grid. 



left, shows the graph diagram of the relation for the grid and, right, the edges given by the relation Rs for n = 105. Clearly 
convergence would be obtained for large enough #T. This works because whilst the content of the world around us is very 
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Figure 7: The grid edges compared with the edges given by Rs- 

varied it is nevertheless highly structured relative to the underlying geometry of the space. Brightness classes were used in 
the study so that the nodes, the grid points, would represent neurons in VI that respond to rod cells in the retina. We should 
note that the rod cells are arranged more in the form of a hexagonal lattice than a grid. Further it would be interesting to 
repeat this study with each grid point split into three separate nodes giving one for each cone cell type so that #5 = 75. The 
cone cells respond either to red, green or blue. The resulting relation Rs may suggest a solution to the binding problem for 
color perception. Finally we should consider what might determine the repertoire of a neuron. The brain itself should define 
this. For example, if a small change in the output frequency of a neuron has no affect on the system then with respect to the 
system the neuron's state is the same. Similarly if switching over the outputs of two different neurons would have no affect 
on the system then with respect to the system the neurons are in the same state. This last point is just a suggestion. Note 
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that such a definition of relative node state may resuh in the relation/? (5,) for 5, G T no longer being transitive. We will now 
move onto our discussion concerning objects. 

3.1 Relations between objects defined by typical data 

We start this subsection with a definition. 

Definition 3.1. Let 5 be a nonempty finite set with typical data T and the relation R^ defined on S by T . Let X be some other 
finite set with #X < #S. We say that 

Xj := {{a,Xj{a)) : a <eX, xy : X ^> {0, 1}}, with a relation /?x on Xj, (8) 

is an object ofS if there is some Sj £ T, Sj = {{a,fi(a)) : a E S, /; : 5 — ?• {0, 1}} with relation 

Rs, := {{{aj,{a)),{bj,ib))) : {a,b) e Rs}, (9) 

and an injective map Ajj : Xj — > Sj, given by Aji{{a,Xj{a))) :— {Xji{a),fi{Xji{a))) where Xji{a) G S, such that for all 
{a,Xj{a)),{b,Xj{b)) G Xj we have: 

(i) Xj{a) ^ fi{Xji{a)); 

(ii) {{a,Xj{a)), {b,Xj{b))) G Rxj if and only if {Aji{{a,Xj{a))),Aji{{b,Xj{b)))) G /?s,.. 

We say that the object Xj embeds into 5, and denote the set of all objects of S by ff. 

We will now show that typical data T defines a relation R^ on the set of objects of S as follows. For Xj G & let Tx '■= 
{Si : Xj embeds into 5, where 5, G T}. Note, by Definition l3.1l that Tf is not empty. Now the relation R^ is given by 

Rff := I (XjJk) : ^^^ > 0.5 where (X„F,) e i^H . (10) 

We note that in general Rff need not be symmetric or transitive and that it is the relation obtained by applying a rounding 
threshold of 0.5 to the weighted relation R^ given in Figure |8] Similar to the situation in Example 12. II one can obtain a 
totally ordered hierarchy of relations on ff by varying the rounding threshold applied to R^r- Turning our attention to the 
topic of float entropy that we began in Subsection |2.1| we note that if the receiver not only has a copy of S and Rs but also has 
a copy of Rff then the elements of T should be even more compressible and are even more structured relative to the relations 
available to the receiver Finally we note that the theory in this paper easily generalises to cases where the neurons, or other 
nodes, have more than a two state repertoire, that is we can allow fi to take more than two values in the definition of a data 
element 5, given in Definition ll.il In this case one also makes a similar adjustment to the definition of an object Xj of S. 
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Figure 8: The weighted relation table for R_^ on ff determined by T. 

4 Development, testing and conclusion 

There are different ways in which this theory can be developed. From a purely theoretical perspective it is interesting to 
establish the range of structures that can be defined by typical data comprised of comparable nodes noting for example 
that functions can be defined by relations. This general theory can then be applied to any dynamical system comprised of 
comparable nodes, e.g. networks. More practically the wealth of established knowledge concerning brain function offers an 
interdisciplinary approach to theoretical development. Furthermore the theory needs to be tested. In this respect functional 
MRI with high spatial resolution and other brain imaging technologies could be used. For example FMRI has already 
been used as a way of obtaining information about the state of VI that is sufficient for image reconstruction, see ifTOl . 
However due to spatial distortion of the retino-cortical mapping and restricted FMRI voxel resolution, and perhaps other 
factors, it is not possible to recognize viewed stimulus from FMRI images directly. Reconstruction often uses methods 
from linear mathematics and probability where knowledge of the visual stimulus used is necessary during the setup stage. 
Taking the elements of S to be the voxels covering VI it is interesting to know whether typical data would give rise to a 
geometric relationship between the voxels, differing from their FMRI image positions, such that the viewed stimulus would 
be recognizable from the repositioned voxels. Two methods could be tried when establishing the geometry on S. The first 
would follow the theory as presented in Section |3] For the second the distances between the voxels could be obtained from 
the map d : S^ -^ [0, 1], d{a,b) := 1 —RT{a,b). In both cases each relation R{Si) should have numerical cell values since the 
similarity of voxel states can be quantified in the range to 1 . One type of visual stimulus to try would have a single transition 
line placed at random in the field of view. 

4.1 Conclusion 

We have already mentioned in Section [3] that the BCM version of Hebbian theory provides evidence of how the brain itself 
defines typical data. We mentioned the evidence in the case of the primary visual cortex VI but there is also evidence for the 
relevance of BCM theory regarding the hippocampus, see [|1H . In particular the typical data that VI defines should be typical 
of the states induced by signals from the retina. In Section |3] it is shown, at least in theory and up to a good analogy using 
a digital camera, that for appropriate parameters such typical data defines a relation on the set of neurons of VI that gives 
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the perceived geometry for monocular vision. The relation is defined by the typical data by being special in the sense that it 
minimizes the expectation of the float entropy of the system. However our theory is intended to be applied to typical data for 
the whole brain so that such a relation also determines how the states of other sensory regions are perceived. For example the 
relation on the auditory cortex might define how we perceive the relationship between the pitches of the chromatic scale. Of 
course more work is required in order to determine the extent to which this theory can account for how the brain defines the 
various aspects of consciousness. 

However at the higher semantic level it is fairly clear that the typical data for the brain defines relationships between objects in 
the way described in Subsection l3.1l For example a good impressionist painting provides VI with just enough of a particular 
stimulus such that VI produces the same state as that induced by a photograph of the same subject. This ability of the brain 
is widely known as filling-in and shows that typical data defined by the brain will determine a strong relationship between 
certain objects. Furthermore it is well known that certain parts of the thalamus act as a relay between different parts of the 
cortex including different sensory regions. This and other connections can arguably result in the brain defining typical data 
that determines relationships between objects arising from different sensory regions of the cortex, [4] is of relevance here. 
Further the states of the brain during dreaming, visualization with the eyes closed and inner sound are all instances of typical 
data produced by the brain itself independent of the senses at the time. 

We will now turn our attention to what is known as the binding problem. In short the binding problem can be summarized by 
the following observation and question. The visual content of our conscious experience correlates with the state of the visual 
cortex, whereas the sound content of out conscious experience correlates with the state of the auditory cortex. How therefore 
can the state of two quite distinct and spatially separated brain regions give rise to a single unified conscious experience? If 
the theory presented in this paper is correct then the answer is quite straightforward. The content of consciousness is defined 
by the state of the brain interpreted in the context of the relations, such as those discussed above, defined by the brain's typical 
data. The typical data is determined by the brain's structure. Hence consciousness is a property of the brain as opposed to 
being an output of some algorithmic procedure or relying on some homunculus concept. A compact disc on its own is almost 
meaningless but in the context of a sufficiently large CD library it is a specific piece of music, Beethoven for example or 
Mozart perhaps. Similarly a brain state on its own is almost meaningless but in the context of the brain's typical data it is a 
moment of consciousness by which we mean the brain state with the relations defined on it by the typical data and this is for 
example the view of the coffee cup with the sound of the radio and the taste of the coffee all together. 

Finally this paper if correct still leaves many questions unanswered and the lack of an attempt to answer them in the context 
of this initial proposition of the theory is rightful cause for some criticism. Here are a few of these questions: 

(i) Can the theory explain the conscious experience of the color red or does the theory need to be extended? 

(ii) What are the other relations that typical data define? 

(iii) What connections are there, if any, between our theory and the theory of consciousness as integrated information as 
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proposed by Giulio Tononi, see H],? 

(iv) Even though the neurons are an obvious candidate for the elements of the carrier set S are they the right candidate? 

(v) Let Si be the data element for a given brain state. Is all of the relation Rg- contributing to consciousness regarding 5, or 
is only a subset 

Rff{Si) := {{Xj,Yk) : {XjJk) E Rg where bothXy and Y^ embed in to 5,}? (11) 

(vi) Is it useful to also consider a carrier set where the elements are time dependent neurons over a short time interval or 
some discrete version of the same involving short finite sequences? 
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